A technical overview of Qwen3‑TTS from Alibaba’s Qwen team: one‑line pip install, 3‑second voice cloning, natural‑language voice design, and support for 10 languages including Japanese. Apache 2.0 licensed.
Overview of PersonaPlex‑7B‑v1 released by NVIDIA in January 2026. A Moshi‑based voice dialog model that enables full‑duplex conversation and persona control.
The Web Speech API + Gemini + VOICEVOX setup is complete — an AI character you can actually have a voice conversation with. Key implementation notes and impressions.